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Abstract. The assertion that there is an intrinsic excess of binaries with mass ratios q ~ 1 - the twin hypothesis - 
is investigated. A strong version of this hypothesis (H s ), due to Lucy & Ricco (1979) and Tokovinin (2000), refers 
to a narrow peak in the distribution function ip(q) for q <; 0.95. A weak version (TC W ), due to Halbwachs et al. 
(2003), refers to a broad peak for q <; 0.8. Current data on SB2's is analysed and H s is found to be statistically 
significant for a sample restricted to orbits of high precision. But claims that TL S is significant for binaries with 
special characteristics are not confirmed since the sample sizes are well below the minimum required for a reliable 
test. With regard to TL W , additional observational evidence is not presented, but evidence to the contrary in 
the form of Hogeveen's (1992b) model of biased sampling with %j) oc q~ 2 is criticized. Specifically, his success in 
thus fitting catalogued data depends on implausible assumptions about the research methodologies of binary-star 
spectroscopists. 
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1. Introduction 

Statistical studies of binaries are hindered by selection ef- 
fects. For example, because of the steep mass-luminosity 
relation for main-sequence stars, the discovery of double- 
lined spectroscopic binaries (SB2's) favours systems with 
mass ratios q ~ 1. However, notwithstanding this bias, 
some authors have argued that there is an intrinsic ex- 
cess of binaries with components having similar masses. 
This claim may conveniently be referred to as the twin hy- 
pothesis. Moreover, there are strong (Tt s ) and weak (TC W ) 
versions of this hypothesis. 

According to Halbwachs et al. (2003), an excess oc- 
curs for q <; 0.8, and they refer to such binaries as twins 
(Tt w ). This claim results from their finding that ip(q), the 
probability density function (pdf) of the q's of unevolved 
binaries, has a positive slope for q <; 0.75. These authors 
also review earlier determinations of tp(q), in particular 
those claiming a bimodal pdf with one peak at q ~ 1. 

On the other hand, Tokovinin (2000, hereafter TkOO) 
refers to binaries with q > 0.95 as twins (Ti. s ). This is 
motivated by the claims of Lucy & Ricco (1979, hereafter 
LR79) and TkOO that there is significant excess of binaries 
with components having almost identical masses. 

Note that 7i s and 7i w are not mutually exclusive, nor 
does one imply the other: either or both could be true or 
false. 

If true, H s offers the tantalizing prospect of a deci- 
sive test of binary formation theories, which then stand or 
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fall according to whether or not they predict a near delta 
function at q ~ 1. But if TL S is false, ip(q) is a smooth and 
relatively featureless function, and so binary-formation 
simulations might have to sample the (unknown) distri- 
bution of pre-binary configurations in order to critically 
test formation theories An analogy with spectroscopy is 
instructive here: the discovery of just one weak emission 
line superposed on a featureless continuum dramatically 
enhances the prospects of theoretical interpretation. 

Because of the potential importance of Ti s , this paper 
adopts a different approach to its investigation from those 
of LR79 and TkOO as well as re-evaluating claims based 
on small samples. In addition, Hogeveen's (1992b) finding 
of a negatively-sloped ip(q), which would invalidate both 
TL S and H. w , is examined in detail. 

Throughout this paper, with the exception of Sect. 2. 5, 
binary data is discussed on the assumption that ip(q) is 
independent of binary mass and angular momentum - see 
LR79, Sect.I. 



2. The strong twin hypothesis 

In this section, evidence for Ti s is sought from current data 
on SB2's, and the significance of this evidence is quanti- 
fied. 



2.1. Measurement errors 

A major part of LR79 was devoted to correcting the q- 
distribution of SB2's for measurement errors. The start- 
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ing point of that investigation was the conjecture that the 
preference for q ~ 1 suggested by such classical binaries 
as Y Cyg and YY Gem was less evident for the totality 
of catalogued SB2's because of the inclusion of less pre- 
cise orbits. Accordingly, LR79 went back to the discov- 
ery papers and derived or computed the standard errors 
(T q . This excercise revealed that systems with a q <; 0.05 
were not uncommon, and so any narrow peak at q ~ 1 
would inevitably be softened by measurement errors. The 
error-free pdf was then estimated using a version of the 
Richardson-Lucy (R-L) deconvolution technique for data 
sets where each measurement q has a different a q . The re- 
sulting tp(q) has a sharp peak at q ~ 0.97, accounting for 
about half of binaries in the interval (0.95,1.0) - see Fig. 
3 in LR79. 

The use of the R-L technique by LR79 was later crit- 
icized by Hogeveen (1992a). He argues that the peak at 
q ~ 0.97 is an artefact of the deconvolution procedure. But 
TkOO's confirmation of Tt s cannot be similarly challenged 
since it does not rely on deconvolution. 

2.2. SB2 sample from the 9th catalogue 

The SB2's investigated in LR79 were drawn from the 6th 
catalogue of the orbital elements of spectroscopic binary 
systems. The current greatly expanded version is the 9th 
catalogue (Pourbaix et al. 2004), and this will be the 
source of data for this investigation - specifically, the up- 
dated online version as at 15.04.05. 

In order to re-investigate the effect of errors on the 
^-distribution, all SB2's were selected for which the com- 
ponents' semi-amplitudes K\ t 2 and their standard errors 
uk x 2 are recorded. Then, with q defined to be < 1, all sys- 
tems with q > 0.84 were selected, and their a q 's computed 
from the propagation-of-errors formula, 
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Fig. 1. Distribution of standard errors a q for 211 SB2's with 
q > 0.84. The vertical dashed line divides the data into high- 
and low-precision samples, labelled Si and S2, respectively. 
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The result is a sample of 211 systems, with a q ranging 
from 0.0012 to 0.19. A histogram showing the distribution 
of the a q 's is given in Fig.l. - 

This data is now divided into a high precision sample 
(Si) comprising 102 systems with a q < 0.01 and a low 

precision sample (S 2 ) comprising 109 systems with a q > Fig. 2. Mass ratio distributions for high- and low-precision 
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0.01. For each of these samples, the q's are counted into 
overlapping bins of width 0.02 with centres separated by 
0.01. The resulting counts N together with \/N error bars 
are plotted in Fig. 2. 

Fig 2 supports the basic premise of LR79 that there 
is a narrow peak at q ~ 1 that is partially obscured by 
measurement errors. With a q < 0.01, negligible error- 
broadening occurs for Si, and the plot shows marked ex- 
cesses in the bins centred at 0.99 and 0.98 - i.e., for the 
interval (0.97,1.0). In contrast, for S2, where significant 
broadening is inevitable, no peak is evident. 

Note that this dependence of the ^-distribution on pre- 
cision, as posited by LR79, is absent from the simulations 



samples. The filled circles are for 102 binaries with a q < 0.01, 
and the open circles for 109 systems with a q > 0.01. Error bars 
arc ±VN. 



of Hogeveen (1992a, Sect. 5). Instead of deriving the a q s 
from the discovery papers as in LR79 and here, he as- 
signs random artificial errors a q and so effectively his q- 
distributions are impervious to degradation by measure- 
ment errors. His simulations therefore miss the essential 
effect of the high information content from high precision 
orbits being imposed on the less precise data. This same 
effect is striking when the R-L algorithm is applied to the 
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problem of co-adding images with different psf's (Lucy 
1991). 

The peak at q ~ 1 for Si indicates, in agreement with 
TkOO, that H s need not now rely for its support on a 
deconvolution procedure as in LR79. 

2.3. Tests of significance 

The error bars in Fig. 2 give a rough indication of the sig- 
nificance of the differences between Si and S 2 . To quantify 
this significance, we now test the null hypothesis Hq that 
the two samples are drawn form the same parent popula- 
tion. 

The simplest test is to construct a 2 x 2 contingency 
table. To do this, the two samples (i = 1,2) are themselves 
divided at q = qd into low- and high-q subsamples, labelled 
j = 1 and j = 2, respectively. 

Taking the hint provided by Fig 2, we take qd = 0.97. 
The resulting counts are Nu = 62, N12 = 40, N 2 i = 
93,^22 = 16. The expected values on Ho are: nu = 74.9, 
7112 = 27.1, n 2 i =80.1, n 22 = 28.9, resulting in x 2 = 16.3 
with 1 degree of freedom - see, e.g., Press et al. (1992, 
p.624). The probability on H that \ 2 > 16.3 is p = 5.5 x 
10~ 5 . Accordingly, we reject H and conclude that the 
differences between Si and S2 are highly significant. 

This test can be repeated with different values of qd- 
With qd = 0.95, x 2 = 14-2, corresponding to p = 1.6 x 
10 -4 , a negligible loss of significance. But with qd — 0.94 
or lower, the peak is increasingly diluted by the back- 
ground, and the significance of the rejection of H drops 
sharply. This indicates that the lower limit of the peak 
associated with H s occurs at q ~ 0.95, in agreement with 
Fig. 3 in LR79. 

Another approach to quantifying the significance of 
differences between Si and £2 is the Kolmogorov-Smirnov 
(K-S) test, which has the merit of not requiring binning. 
But here, because the differences are at extreme end of the 
range - i.e., at q ~ 1, we follow Press et al. (1992, p. 620) 
and use the K-S variant due to N.H.Kuiper. Applying the 
Kuiper test to the ordered q's in Si and S2, we find that 
Kuiper's statistic V = D + + D- = 0.33, and the proba- 
bility on H that V > 0.33 is p = 2.7 x 10~ 4 , confirming 
highly significant differences. 

Because Kuiper's variant is not as familiar as the orig- 
inal K-S test, a Monte Carlo version has also been car- 
ried out. With tti — initially, the following simulation 
is carried out M times : from the 211 values of q, 102 
are randomly selected for assignment to Si, with the re- 
mainder assigned to S 2 . The Kuiper statistic V is then 
computed from the ordered g's in Si and S 2 , and we set 
m = m+1 if V > 0.33. Now, by construction, Si and S 2 
are drawn from the same parent population, and so Hq 
is true. Accordingly, at the conclusion of the experiment, 
m/M is an unbiased estimate of the probability on H that 
V > 0.33. An experiment with M = 10 7 gives m = 2302, 
so that p ~ 2.3x 10~ 4 , in satisfactory agreement with the 
estimate given by Kuiper's asymptotic theory. 
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Fig. 3. Error rates as function of sample size N when testing 
Hs- The samples comprise binaries with q > 0.84 and o q < 
0.01 and are constructed by bootstrap resampling from the 
102 binaries in Si. The percentage rates are plotted for the 1 
and 5 percent levels of significance. 



2.4. Minimum sample sizes 

The above tests show that a sample of 102 binaries with 
q > 0.84 and u q < 0.01 suffices to claim the discovery of a 
significant peak at q ~ 1 and thus support TL S . But if N, 
the number of binaries with accurate orbits, were reduced, 
the peak's significance would drop and the support for H. s 
would weaken. Thus, as N decreases, there is an increasing 
chance of a false negative - i.e. of rejecting H s even though 
it is true. 

This risk can be crudely quantified with a bootstrap 
technique. The following calulation is repeated numerous 
times as a function of TV: a sample Si of N binaries with 
<jq < 0.01 is created, each binary being randomly selected 
from Si - i.e., sampling with replacement. Kuiper's test is 
then applied to compare Si with S2, and the probability 
p computed that the Kuiper statistic could be exceeded if 
Ho is true. From these, we can derive the percentages of 
the samples Si for which H is not rejected at the 1 and 5 
percent levels of significance. These error rates are plotted 
as a function of N in Fig. 3. 

A minimum sample size N min can be defined as being 
such that for TV < N min there is a > 10 percent risk of 
failing to confirm H s even though it is true. From Fig. 
3, we find that N min ~ 79 and ~ 54 for the 1 and 5 
percent levels of significance, respectively. These results 
confirm and quantify the remark in TkOO that H s cannot 
be detected in small samples. 



4 



Lucy: The twin hypothesis 



2.5. Limiting period 

From the data then available, LR79 concluded that H s is 
no longer significant for binary periods P > 25 days. The 
existence of a cut-off period Pl was confirmed by TkOO 
and Halbwachs et al (2003) but revised upwards to 40 
and 50 days, repectively. 

The Kuiper test can also be used to estimate Pl from 
the samples Si and 52- The calculation proceeds as fol- 
lows: a sample S\ is created from Si by excluding binaries 
with P > P*. Kuiper's test is then applied to compare Si 
and S2 and the probability p computed that these samples 
are drawn from the same parent population. 

Such calculations show that excluding long period sys- 
tems indeed increases the significance of H s . A plot of p 
against P* shows a broad minimum with p ~ 2.5 x 10 -5 for 
P* in the interval (24, 43) days, with p increasing sharply 
for longer periods. Thus Pl — 43 days is the best estimate 
of the period above which the data markedly dilutes the 
significance of H s . To within errors, this agrees with the 
estimates of TkOO and Halbwachs et al. (2003). No doubt, 
with increased data, this cut-off period will be found to 
depend on binary mass. 

3. Special samples 

In this section, we evaluate claims that H s receives sup- 
port from binaries with special characteristics. 

3.1. BY Draconis binaries 

In LR79, data for 9 binaries among the 17 BY Dra vari- 
ables known at that time was cited as providing strong 
support for H s . Subsequently, at the suggestion of an 
anonymous referee, Fckcl et al. (1988) re-examined the 
evidence for a peak at q ~ 1 on the basis of an enlarged 
sample of 20 BY Dra binaries. It turns out that the ad- 
ditional binaries fill in the interval (0.80,0.95), resulting 
in a roughly uniform distribution from 0.80 to 1.0, and so 
Fekel et al. (1988) correctly report that there is no longer 
evidence for a sharp peak at q ~ 1. Nevertheless, in view 
of the small sample size, it does not follow that H s must 
be rejected for the BY Dra binaries. 

Of the 20 binaries in the Fekel et al. sample, 8 fall in the 
interval (0.84, 1.0) and only 3 have o q < 0.01. But even if 
all the orbits were precise, the sample of 8 is well below the 
minima derived in Sect. 2.4, and so there is a considerable 
risk of rejecting H s even if it is true. Specifically, for TV = 
8, Fig. 3 tells us that there is a 76 percent chance of failing 
to confirm H s at the 5 percent level and a 91 percent 
chance at the 1 percent level. 

3.2. Eclipsing binaries in the SMC 

Recently, Pinsonneault & Stanek (2006) have claimed sup- 
port for H s from a sample of 50 eclipsing SB2's of spectral 
types O and B in the Small Magellanic Cloud (SMC). The 
spectroscopic observations were obtained by Harries et al. 



(2003) and Hilditch et al. (2005) who describe their radial 
velocities as being of modest quality, no doubt reflecting 
practical difficulties of their pioneering project. Of these 
50 systems, 15 have q's in the interval (0.84,1.0), and these 
have cr ? 's ranging from 0.013 to 0.12. Thus, compared to 
the Galactic SB2's discussed in Sect. 2, these orbits would 
be assigned to the low precision sample. 

Even if these 15 SMC binaries had precise orbits, the 
sample size is far below the minima derived in Sect. 2.4. 
Nevertheless, in view of the strong claim by Pinsonneault 
& Stanek, the SMC sample has been compared to S2 to see 
if Ho is rejected. The resulting Kuiper statistic is V — 1.24 
and the probability on H that V > 1.24 is p = 0.48. Thus 
the claim that H s is supported by the SMC binaries is not 
even a la result. 

4. Selection effects 

Before concluding that the narrow peak at q ~ 1 is astro- 
physically, as distinct from merely statistically significant, 
we must show that it has not been imprinted on the data 
by selection effects. In Sect. VI of LR79, five selection ef- 
fects were discussed in detail and found not to be capable 
of sharply reducing the discovery probability already at 
q ~ 0.95, the shortward edge of the narrow peak. Three 
further selection effects are now similarly discussed. 

4.1. Biased sampling 

Hogcvccn (1992b) has argued that the positive slope of 
<j)(q), the g-distribution of catalogued SB2's, is entirely 
due to selection effects. More specifically, by constructing 
a synthetic SB catalogue, he claims to demonstrate with a 
model of biased sampling that the montonically decreasing 
pdf 

ip{q) oc q- 2 for q > 0.3 (2) 

that he derives for SBl's is valid also for SB2's - i.e., for 
q k, 0.65. 

To understand and evaluate this remarkable claim, his 
work is here repeated, the only change being a less abstract 
terminology. The basic assumption is that Galactic bina- 
ries have the ^-distribution given by Eq. (2). Hogeveen's 
model for predicting the contents of the SB catalogue is 
then the following: the solar neighbourhood is divided 
into non-overlapping volumes Vi, with observer i being 
awarded exclusive rights to Vi. Each observer's task is to 
discover, monitor and eventually publish orbits for some 
or all of the rij binaries in his assigned search volume. 
The synthetic catalogue then comprises the ensemble of 
published orbits. 

In implementing this model, Hogcvecn (1992b, 
Appendix B) assumes that, independently of rij, observer 
i publishes just one orbit, that of the system closest to 
q = 1. At one extreme, this happens when only one binary 
is discovered and the search procedure stongly favours 
q ~ 1. At the other extreme, all rij binaries are discovered 
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Fig. 4. Probability density functions 4>(q) of biased samples 
according to the model of Hogeveen (1992b). The values of the 
parameter n are indicated. For n = 1, the true pdf is recovered, 
namely tp oc q~ 2 . The dashed curve is Hogeveen's preferred 
solution which has the rn distributed uniformly in the interval 
(1,20). 

but observer i then selects, as 'most suitable' for monitor- 
ing and publication, the system closest to q = 1. 

Fig. 4 shows the resulting biased pdf 's <j>(q) of synthetic 
catalogues. These simulations divide space into 10 6 vol- 
umes Vi so that sampling errors are negligible. The sim- 
plest version of the model sets rij = n, a constant. For 
n = 1, all binaries are discovered and published so that 
0i(<z) — ip(l)y the true distribution. But (j>i(q) is mono- 
tonically decreasing and so is in conflict with the positive 
slope of 4>{q). Fortunately, for n > 1, the assumed bias to- 
wards q — 1 takes effect, and 4> n {q) becomes monotonically 
increasing for n <; 6. Clearly as n — > oo, each observer's 
published orbit has q — > 1, and so 4> n (q) tends to a delta 
function at q = 1. 

Because the slope of <f>{q) - see Fig. 8 in Hogeveen 
(1992b) - is bracketed by the slopes of (f>i(q) and <^>oo(<7)> 
there is some intermediate n ~ 10 that roughly fits the 
data. Moreover, this fit can be improved by exploiting the 
freedom to depart from constant rij. Specifically with the 
rii uniformly distributed in the interval (1,20), Hogeveen 
achieves a fit to <p(q) that is accepted by the K-S test. 

Given this success, Hogeveen's (1992b) work appears 
to stongly support the pdf given in Eq.(2); consequently, 
several groups (Gualandris et al 2005; Ciardullo et al. 
2005; Dionne & Robert 2006) have adopted this pdf 
in their population synthesis codes. Yet others (Pols & 
Marinus 1994; Portegies & Verbunt 1996) cite Hogeveen 
but adopt the somewhat less steeply declining pdf ij){q) = 
2/(1 + q) 2 suggested by Kuipcr (1935). These analytic 
pdf's arc compared in Fig. 5 to the piecewise-linear model 
of Halbwachs et al. (2003). To avoid complexity the ap- 



Fig. 5. Comparison of pdf's. The analytic formulae ip(q) 
proposed by Kuiper(1935) and Hogeveen (1992b) are com- 
pared with the corrected q-distribution given in Fig.7(a) of 
Halbwachs et al. (2003). 



proximately fiat pdf for q > 0.3 found by Mazeh et al. 
(2003) is not included. 

However, the K-S test docs not provide compelling 
support for Hogeveen's ip(q): the sensitivity illustrated in 
Fig. 4 of 4>{q) to the assumptions about m all but guaran- 
tees that a good fit can be achieved. Accordingly this suc- 
cess should not of itself induce belief in Eq.(2), especially 
not if the model and its fitted parameters are an implausi- 
ble description of how spectroscopists conduct themselves. 

The following comments on the basic elements of 
Hogeveen's model suggest that it is indeed implausible: 

1) Of the Hi binaries in Vi, only the system closest to 
q = 1 has its orbit published. 

Hogeveen justifies this bold assumption by supposing 
that an observer, having discovered several SB2's, selects, 
for monitoring and publication, the system whose com- 
ponents have the most nearly equal line depths since an 
accurate orbit is then more readily achieved. But spectro- 
scopists, being aware of the extra information obtainable 
from SB2's, surely do not fail in this way to capitalize on 
their own discoveries. Moreover, they do not set aside diffi- 
cult systems, as solutions for triple and quadruple systems 
demonstrate. 

Alternatively Hogeveen supposes that systems just 
above the detection threshold for SB2's are underrepre- 
sented in the SB catalogue and correspondingly overrepre- 
sented among observers' as yet unreported discoveries. He 
conjectures that such binaries remain longer on observing 
programs so that acceptable accuracy can be achieved for 
K 2 - Fig. 6 tests this conjecture by plotting N 2 , the num- 
ber of velocities measured for the less massive component, 
against q for systems in the 9th catalogue. There is no ev- 
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Fig. 6. Scatter plot of N2, the number of radial velocities mea- 
sured for the less massive component, against q, the mass ratio. 
All binaries from the 9th catalogue (Pourbaix et al. 2004) with 
q > 0.6 are plotted. 
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Fig. 7. Histograms of ^-distributions for Hogeveen's (1992b) 
preferred model, normalized so that catalogued SB's number 
1000. The corresponding total number of binaries in the vol- 
umes Vi is 10,500 and so the number of undiscovered or unre- 
ported systems is 9,500. 



idcnce that observers accumulate more data for systems 
with q <, 0.7. 

2) The rii — 1 binaries in V, not monitored and pub- 
lished by observer i remain unobserved and are thus ex- 
cluded from the SB catalogue. 

This aspect of Hogeveen's model follows from granting 
observer i exclusive rights to Vi. But, in reality, there is no 
such exclusivity, and so an inefficient observer (1/n, <C 1) 
would often be scooped by competitors observing the same 
patch of sky. Accordingly, Hogeveen's reliance on such ob- 
servers to achieve a positively-sloped <p(q) is especially du- 
bious. 

The extent to which Hogeveen's fit to <j)(q) depends 
on excluded binaries is worth noting, as is their q- 
distribution. With rij uniformly distributed in (1,20), the 
average number of binaries in a search volume is 10.5. 
Accordingly, if there are 1000 Vi, then, out of a total 
of 10,500 binaries in T,Vi, the orbits of 1000 are cata- 
logued, leaving 9,500 undiscovered or unreported. The 
^-distributions of these three categories are plotted in 
Fig. 7. We see that, with decreasing q, Hogeveen's preferred 
model implies a steep decline in discovery probability or 
of spectroscopists' inclination to publish. 

Note that 7,750 binaries out of the total of 10,500 are in 
volumes assigned exclusively to inefficient observers with 
1/rii < 0.1 whose activities result in a mere 500 catalogued 
orbits. It is surely not true that such a paucity of effort 
by spectroscopists applies to 73.8 per cent of the binaries 
in the solar neighbourhood. 

Hogeveen's (l,20)-model is an attempt to simulate the 
haphazard manner in which the SB catalogue is assem- 
bled. As such, it has no relevance to the organized and 



disciplined multi-year surveys that underpin the work of 
Halbwachs et al. (2003). Accordingly, there is no possibil- 
ity of invoking biased sampling to overturn their evidence 
in favour of H w nor, in particular, to achieve consistency 
with Eq.(2). 

The above analysis of Hogeveen's (1992b) model indi- 
cates that his treatment of biased sampling does not pro- 
vide convincing support for Eq.(2). Indeed, the extremes 
necessary to achieve a fit to <j)(q) suggest that this pdf 
is strongly excluded. In any case, his work should not be 
interpreted as falsifying either H w or H s - 

4.2. Eclipse-probability bias 

In Sect. 3.2, we saw that the sample of SMC binaries is 
too small and the g's too imprecise to support H s . But 
the possibility remains that the data might support Ti. w , 
since precision is less of an issue in confirming a broad 
peak with FWHM - 0.13 - see Fig.5. 

Binaries in the SMC sample selected themselves by ex- 
hibiting eclipses. This sample is therefore biased in favour 
of high eclipse probabilities, and there is no reason to sup- 
pose that a sample thus biased will be unbiased with re- 
spect to q. Accordingly, ip(q) cannot be determined by 
simply binning the q's of these eB's. 

However, the recent paper by Soderhjelm & Dischler 
(2005) shows how one might proceed in estimating ip(q). 
They have developed a population synthesis code that pre- 
dicts the properties of eB's as a function of eclipse depth 
and orbital period. With the appropriate selection crite- 
ria imposed, the (/-distribution <f>(q) of the SMC eB's could 
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be predicted with this code. The input tj)(q) could then be 
varied until a satisfactory fit is obtained to <j)(q). 

Although in principle possible, this approach is not 
practicable. In order to extract ip(q), we would have to 
input other statistical properties of SMC binaries, specif- 
ically the distributions of their separations and eccentric- 
ities. But these are not well determined even for Galactic 
binaries. 

From Table A.2 of Soderhjelm & Dischler (2005), the 
fraction of OB binaries showing detectable eclipses is 
~ 10~ 3 . To determine statistical properties reliably from 
such a small fraction requires intclligently-contolled sam- 
pling , as in opinion polling, or sampling with accurately 
known bias. Neither of these criteria obtain for the SMC 
cB's. 

The above remarks demonstrate that this sample of 
eB's is not suitable for deriving the statistical properties 
of binaries in the SMC. But such was not among the aims 
of Harries et al. (2003) and Hilditch et al. (2005). Their 
focus is on fundamental parameters in order to investigate 
single- and binary-star evolution at low metallicity and to 
derive the distance to the SMC. Nevertheless, their suc- 
cessful application of a multi-object spectrograph to ob- 
tain numerous SB orbits in the SMC indicate that a survey 
of non-eclipsing massive stars is not infeasible. By discov- 
ering and then monitoring SB's in the SMC (or LMC), 
such a survey could yield useful information on binary 
statistics in an cxtragalactic population. 

Although directed at the SMC sample, these negative 
arguments apply to any sample of eB's. This explains why 
statistical studies of ip(q) have not hitherto been based on 
eB's. The only viable alternative to SB samples is the 
distribution of offsets in colour-magnitude diagrams due 
to binarity. Kahlcr (1999) has analysed such photometric 
binaries in the Pleiades and finds a bimodal g-distribution 
with a broad peak at q ~ 1, thus supporting H w . The 
modest number of offsets and the limited precision of their 
measurement preclude a decisive test of H s . 

4.3. Precision bias 

In order to avoid deconvolution, a high-precision sample 
has been used in Sect. 2 to confirm TL S . But, as the referee 
(J.L.Halbwachs) points out, this selection itself introduces 
a bias that to some degree favours q ~ 1. For an SB2 with 
q 7^ 1, the lines of the secondary are generally the weaker, 
and so we expect that <Jk 2 /K 2 > vid/Ki- Moreover, this 
disparity will increase with decreasing q thus reducing the 
fraction of SB2's meeting a fixed precision criterion such 
as o q < 0.01. 

This effect can be investigated for SB2's in the 9th 
catalogue. From Eq.(l), the fractional contribution of the 
secondary (K 2 > K\) to the variance a q is 



f _ I 



°K 2 

K 2 



Mean e o n i. r i b u I. ions 
of secondaries to ffj 



111"- 




0.85 0.9 

Mass ratio 



Fig. 8. Mean fractional contributions fa of secondaries to the 
variance a q as function of mass ratio q. The error bars denote 
the standard errors of the estimates fa. Primary and secondary 
contribute equally along the dotted line fa = 0.5 . 



and we may then readily show that 



<7*/v/2(l - / 2 ) 



(4) 



0.8, 

70 per cent of the variance cr q , 



(3) 



where cr* is the standard error of q when ax 2 /K 2 = 
a Kl /Ki. 

The variation of f 2 with q is shown in Fig. 8, where the 
binning is the same as for Fig. 2 but extending to smaller 
q. For each bin, the mean f 2 is plotted, with error bars 
indicating its standard error. From this figure, we see that 
for q ^ 0.9, there is a barely significant departure from 
f 2 = 0.5, the line along which both components contribute 
equally to a q . But for q 0.9, the departure is signif- 
icant and in the expected direction. Thus, for q 
secondaries contribute 
corresponding to a q ja* q ~ 1.3 

The implications of this excercise are as follows: if 
ip(q) were being determined over an extended interval of 
q from a precision-limited sample, then this bias would 
be of concern. However, since this effect is inconsequential 
for q <; 0.9, precision-bias cannot account for the peak at 
q > 0.95 in Fig.2. 

In LR79, several selection effects were eliminated as 
explanations of the q > 0.95 peak on the basis of theoret- 
ical modelling. This discussion of precision bias is notable 
as being purely empirical. 

5. Conclusion 

The aim of this paper has been to define and investigate 
the weak and strong versions of the twin hypothesis for bi- 
nary stars. For H w , no evidence has been offered beyond 
that of Halbwachs et al. (2003). Nevertheless, the case for 
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7i w has been greatly strengthened by the re-investigation 
in Sect. 4.1 of Hogeveen's (1992b) model of biased sam- 
pling. Although Hogeveen's work serves as a useful warn- 
ing of how misleading inhomogeneous catalogues can be, 
his success in fitting observational data requires such ex- 
treme assumptions about how spectroscopists carry out 
their research programmes that tp(q) oc q~ 2 is surely ex- 
cluded. Accordingly, population synthesis investigations 
based on the ip(q) of Hogcveen or Kuiper should be re- 
interpreted as exploring the consequences of an unrealistic 
depletion of binaries with qr ~ 1. 

With regard to TL S , a narrow peak comprising sys- 
tems with q ^ 0.95 is found to be highly significant, con- 
firming the work of LR79 and TkOO. Moreover, this peak 
is now evident without recourse to deconvolution proce- 
dures. However, it remains true that support for 7i s de- 
rives from analyses of catalogued systems and not from a 
targeted survey as for 7i w . Unfortunately, it is a major un- 
dertaking to carry out an independent survey large enough 
to meet the minimum sample sizes estimated in Sect. 2.4. 
More feasible would be a programme to re-observe SB2's 
in the low-precision sample S2 of Sect. 2. 2 in order to re- 
duce a q to < 0.01. If H s is true, the narrow peak should 
increase in significance as orbits of low precision are thus 
upgraded. This is a project where orbits obtained with 
small and medium-sized telescopes are a worthwhile con- 
tribution (cf. Tomkin & Fekel 2006). 
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C.D.Scarfe for responding promptly and in detail to queries 
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